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Abstract 

We investigate the square variation operator (which majorizes the partial sum max- 
imal operator) on general orthonormal systems (ONS) of size N. We prove that the 
norm of the operator is bounded by 0(ln(7V)) on any ONS. This result is sharp and 
refines the classical Rademacher-Menshov theorem. We show that this can be improved to 
0{^yln{N)) for the trigonometric system, which is also sharp. We show that for any choice 
of coefficients, this truncation of the trigonometric system can be rearranged so that the 

norm of the associated operator is 0{^/hl\n{N)). We also show that for p > 2, a 
bounded ONS of size N can be rearranged so that the norm of the operator is at 
most Op (In ln(7V)) uniformly for all choices of coefficients. This refines Bourgain's work on 
Garsia's conjecture, which is equivalent to the V°° case. Several other results on operators 
of this form are also obtained. The proofs rely on combinatorial and probabilistic methods. 

1 Introduction 

Let T := [0, 1] denote the unit interval with Lebesgue measure dx and let $ :— {0„}„gN 
denote an orthonormal system (ONS) of real or complex valued functions on T. By an ONS, 
we will always mean the set of orthonormal functions {4>n}n<£N and the ordering inherited 
from the index set N. For / e L^, we let a„ = {f,4>n) denote the Fourier coefficients of / 
with respect to the system $. Associated to an ONS is the maximal partial sum operator 



Mf{x) := sup 

N 



N 

an(l>n{x) 

n=l 



It is well known that the boundedness of the operator M implies the almost everwhere 
convergence of the partial sums of the expansion of / G in terms of the ONS <&. Almost 
everywhere convergence is known to fail for some ONS, hence the maximal function M is 
known to be an unbounded operator on for some ONS. There is an optimal estimate 
known for general ONS. 

Theorem 1. (Rademacher-Menshov) Let {0n}neN = ^ and f G be as above. Then, 

(oo 
^|a„pln2(n- 
n=l / 

where the implied constant is absolute. Moreover, the function ln^(rt + 1) cannot be 
replaced with any function that is o{ln'^{n + 1)). 

This last claim is quite deep and is due solely to Menshov. 

While this estimate is optimal in general, it can be improved for many specific systems. 
For instance, the inequality ||A^/||l2 ^ ll/IU^ is known to hold when (f> is taken to be the 
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trigonometric, Rademacher, or Haar systems. We recall the definitions of these systems in 
the next section. 

Recently, variational norm refinements of the maximal function results stated above 
have been investigated. To state these results, we first need to introduce some notation. Let 
a — {onlJ^Li be a sequence of complex numbers. Then we define the r-variation as: 



lallvr lim sup > 



l/r 



riG/ 



where the supremum is taken over all partitions Vk of \K\ (i.e. all ways of dividing [A'] into 
disjoint subintervals) . When a is a finite sequence of length if, the quantity is defined by 
dropping the lim;f_j.oo. 

One can easily verify that this is a norm and is nondecreasing as r decreases. Now we will 
denote the sequence {a„0„(a;)}5^i by S\j\{x). (Note that this is slightly different than the 
notation used in jT^].) When we write ||S'[/]||v'r(x), we mean the function on T whose value 
at X € T is obtained by assigning the r-th variation of the sequence S\j\(x). Furthermore, 
||S'[/]||^p(yr) is the LP norm of this function. Alternately, we have 



1/2 



ii/iiy.(a:) = sup sup \y^\s^mx)~s,,.-Amt 

K no<-<"jf 



where \ = X^nLi o,n4'n{x) is the n/-th partial sum. 

We note that the function ||<5'[/]||v'~(a;) is essentially the maximal function. More pre- 
cisely, Mf{x) <^ \\S[f]\\v°°{x) ^ Mf{x). Since the quantity ||a||yr is nondecreasing as r 
decreases, we see that ||>5'[/]||y(a;) majorizes the maximal function whenever r < oo. In 
[12], the following is proved for the trigonometric system {e^'^*"^}^]^: 

Theorem 2. Let r > 2 and r' < p < oo, where ^ + p- — 1. Then 

\\S[f]\\L.ivn<Cp,r\\f\\L^^ 

where Cp^r is a constant depending only on p and r. 

This result is rather deep, being a strengthened version of the celebrated work of Carleson 
and Hunt on the almost everywhere convergence of Fourier series. The analogous inequal- 
ities were previously obtained in [2| in the simpler situation of Cesaro partial sums of the 
trigonometric system. Moreover, the above inequality is known to hold for the Haar system 
and more generally for martingale differences by Lepingles inequality, a variational variant 
of Doob's maximal inequality. In 12 , it is shown that the condition r > 2 is necessary in 
case of the trigonometric system. Our focus here will be to study the case p = r = 2 for 
general ONS. In this direction, we prove (closely following the classical proof): 

Theorem 3. Let ^ he an ONS. Then 

ll^[/]IU^(v^)« \y^W?\^\n + \)\ . (1) 

//||A^/||i2 ^ A(A'')||/||x,2 /or all f — X]^=i ^n4'n for some real valued function A{N), then 

1/2 



/ N \ 1/2 

« f^^A(n)ln(n-f l)|a„n . (2) 



Interestingly, the first inequality strengthens the Rademacher-Menshov theorem stated 
above, since the right sides are the same (up to implicit constants), yet we have replaced 
the maximal function with the square variation operator on the left side. Since the 
operator dominates the maximal operator, this implies the Rademacher-Menshov theorem 
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and the claim that this result is sharp follows from the sharpness of Rademacher-Menshov. 
This might lead one to think that the two operators behave similarly, however we will see 
that the V'^ operator is much larger than the maximal operator for the classical systems. 
Theorem [3] can be refined further for certain classes of ONS, see Section [7] for discussion of 
this. 

We can apply ^ to the trigonometric system with A(A^) — 0(1), the Carleson-Hunt 
inequality, and obtain the following corollary: 

Corollary 4. Let {e^^™^}^^]^ be the trigonometric system. We then have 



Moreover, the function \n{n + 1) cannot be replaced by a function that is o(ln(n + 1)). 

The lower bound can be obtained by considering the Dirichlet kernel Dn{x) = ^^=1 e^'^™^. 
A proof of this is contained in Section 2 of [12 . Strictly speaking, they work with the de la 
Vallee-Poussin kernel there, but the same proof works for the Dirichlet kernel. 

As we will see below, it is easy to construct an infinite ONS such that ||S'[/]||/,2(y2-) <C 
11/11^2 holds, by choosing the basis functions 0„(x) to have disjoint supports. However, this 
is a very contrived ONS, and it is then natural to ask if there exists a complete ONS such 
that ||S'[/]||i2(y2) <c 11/11^2. This is not possible. In fact, we show slightly more: 

Theorem 5. Let {</>„} be a complete orthogonal system. There exists a L°° function such 
that ||S'[/]||y2(a;) = oo for almost every x. 

In general, this divergence cannot be made quantitative. We show that for any function 
w{n) — ^ oo, there exists a complete ONS such that ||S'[/]||i2(y2-) ^ ti;(A^)| |/| 1^2 whenever 

/(^) = S^=i '^"'/'"(^)- However, a quantitative refinement is possible if we restrict our 
attention to uniformly bounded ONS: 

Theorem 6. In the case of a uniformly bounded ONS, it is not possible for w{N) = 
o(-\/lnln(iV)). However, there do exist uniformly bounded ONS such thatw{N) — 0(-\/ln In(Af)). 

The Rademacher system provides an example of the second claim. See Theorem[5] below. 

Recall that we defined an ONS to be a sequence of orthonormal functions with a specified 
ordering. This is essential since the behavior of the maximal and variational operators 
depend heavily on the ordering. For instance, the Carleson-Hunt bound on the maximal 
function for the trigonometric system makes essential use of the ordering of the system, and 
the result is known to fail for other orderings. It is thus natural to ask what one can say 
about the operator for reorderings of the trigonometric system. Surprisingly, it turns out 
that the 0{yJ\x\{N)) bound can be improved to 0{^\n In(iV)) for any choice of coefficients 
by reordering the system. More generally: 

Theorem 7. Let {(f)n}n=i ONS such that |(/)„(a;)| — 1 for all x and n, and let 

/(^) — Sn=i o,n4'n{x). Then there exists a permutation tt : [N] — > [N] such that 



holds (for sufficiently large N ) with respect to the rearranged ONS {V'n}^=i7 where ipn{x) ■= 

(l>^(n)ix)- 

This is perhaps the most technically interesting part of the paper. This result should 
be compared to Garsia's theorem [Z, which states that the Fourier series of an arbitrary 
function with respect to an arbitrary ONS can be rearranged so that the maximal function 
is bounded on L^. Garsia's proof proceeds by selecting a uniformly random permutation, 
and arguing that it will satisfy the claim with positive probability. In our case, however, 
we randomize over a subset of all permutations. This subset is chosen based on structural 
information about the Fourier coefficients of the function. It is unclear if this restriction 





||/|U2(V.2) « v/hU^^II/ll 
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is necessary or an artifact of our proof techniques. It would be interesting to extend this 
result to more general ONS. We note that it can be seen from the work of Qian [TB] (see also 
our refinement [IT]) that \\Yln=i'^ri\\L^{v^) > y/N\n In(iV) = ^In \n{N) || I]^=i ^"lU^, 
regardless of the ordering of the Rademacher functions r„, hence the •\/lnln(A^) term in the 
statement of the theorem is sharp. A similar result can be obtained for general ONS when 
the coefficients are multiplied by random signs: 

Theorem 8. Let {4>n}n=i '^'^ ONS and f{x) — '^n=i '^n4'n{x) ■ Then there exists a 
sequence of signs such that 



<m Vlnln(iV) \ \g\\L^ 

holds, where g{x) = Yln=i ^nan4>n(.x). 

This easily follows from the following inequality: 

Theorem 9. Let {j'nl^^i be a sequence of uniformly bounded independent random variables. 
Then 

N 




< ^J\n\n{N) 

In particular, combining this with Theorem we see that the L^ norm of the operator 
for the Rademacher system grows like -\/ln In(iV). 

Finally, we prove that the norm of some systems can be improved uniformly for all 
choices of coefficients by a rearrangement, for p > 2. 

Theorem 10. Let {0n}^=i be an ONS such that \\(f>n\\L°° < M for each n, and let p > 2. 
There exists a permutation tt : [N] — >■ [A''] such that the orthonormal system {4'-K{n)}n=i 
satisfies 

ll^[/]||L2(Vp)«Af,plnln(iV)||/|U2 (4) 

for all f = Y.n=l an4>n- 

The maximal V°° version of this result is due to Bourgain [T] and represents the best 
progress known towards Garsia and Kolniogorov's rearrangement conjectures. Our methods 
rely heavily on those developed in that paper. This also leads us to perhaps the most 
interesting open problem relating to operators: 

Question 11. Does there exist a permutation it : [N] — [N] such that the L^ norm of the 
associated operator on the trigonometric system grows like o{^J\n-{N)) ? 

Our Theorems [7] and [TU] may be viewed as evidence that this may in fact be possible. 
It is consistent with our knowledge that one could get growth as slow as ^J\w\x\{N) . It is 
known that purely probabilistic techniques in the maximal {V°°) case can only go as far as 
Bourgain's bound of Inln(iV) (see Remark 2 of ^.). Thus, finding a permutation that reduces 
the growth further (Garsia's conjecture is the assertion that there exists a rearrangement 
that gets to 0(1)) would require fundamentally new ideas. However, it is consistent with our 
current knowledge that the purely probabilistic techniques could get one down to Inln(iV) 
in the V"^ case. If true, this will certainly require a much more delicate analysis than the 
methods used here. Theorem [3] combined with the V°° case of the previous theorem does 
give a bound of yJ\n{N) Inln(A^) for general bounded ONS for the operator. This is a 
nontrivial improvement for some systems, but not the most interesting classical systems. 



2 Notation and General Remarks 

We will work with ONS defined on the unit interval T. The underlying space T plays almost 
no role in our proofs (the role is similar to that of a probability space in probability theory) , 
and one could replace it with an abstract probability space. 
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We assume that the ONS is real valued in most of our results. In these cases, one can 
obtain the same results for complex valued ONS by splitting into real and imaginary parts 
and applying the arguments to each. The details are routine so we omit them. The proof of 
Theorem [7] is the one place where this requires some care, and thus we work with complex 
valued functions directly there. 

We define the trigonometric system to be the system of complex exponentials {e^'^*"^}^^]^. 
Typically the trigonometric system is defined to be the doubly infinite system {e'^^^"'^}^=-ao 
and the maximal and variational operators are defined with respect to the symmetric partial 
sums. However, we find it more convenient to define the trigonometric system this way and 
avoid having to state all of the following results for both singly and doubly infinite systems. 
All of our results can easily be transferred to the doubly infinite setting (using symmetric 
partial sums) by splitting the Fourier series of a function f G (T) with respect to a doubly 
infinite system into two functions with singly infinite Fourier series and applying the results 
in this setting. For instance, note that 



Mfix) 



sup 

N 



N 

E 



an(l}n{x) 



«C sup 

N 





E 



an4>n{x) 



sup 

N 



N 

E 

n=l 



an4'nix) 



Thus it follows that the boundedness of the maximal operator associated to the system 
|g27rma;|oc^^ implies the boudedness of the symmetric maximal operator associated to 
|g27i-ma:|oc^^^^ Similarly for the operators. 

The Haar system, which we denote by {"HnlJ^g, is a complete ONS comprised of the 
following functions. For k E N and 1 < j < 2*^, we define {Hkj} by 




We form the system T-Ln by ordering the basis functions {"Hfcj } first by the parameter k and 
then by the parameter j, or Tin = T~ii.k for n — 2'' + j. Lastly, we set Hq = 1. 
The Rademacher system, denoted {r„(a;)}^;^, is defined by 

r„(x) — sign sin (2"7ra;) . 



The Rademacher system can also be thought of as independent random variables which take 
each of the values { — 1, 1} with probability 1/2. 



3 Variational Rademacher-Menshov-Type Results 

We start by giving a proof of Theorem [31 

It suffices to assume that is a power of 2, say N = 2^. For all i,k such that < i < £ 
and < fc < 2^^* — 1, we consider the collection of intervals Ik,i ■— (fc2% (fc + 1)2*]. 

Lemma 12. Any suhinterval of S C [0, 2^] can be expressed as the disjoint union of intervals 
of the form Ik^i, such as 

S^{jh^,^^ (5) 

m 

where at most two of the intervals Ikm,im union are of each size, and where the union 

consists of at most 21 intervals. 

Proof. Let S = [a, 6] and set i' := max/^.gs*- It follows that there are at most two 
intervals of the form Ik^i' contained in S (otherwise S would contain an interval of the 
form Ik^i'+i). Let r denote the right-most element of the interval with the largest k value 
satisfying Ik^i' C S. Now b ~ r has a unique binary expansion. It easily follows from this 
that (r, b] can be written as [r, b] — [J^ 1^^ where the union contains only one interval 
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of the form Ikm,im of particular size, and these intervals are disjoint. An analogous 
argument allows us to obtain a decomposition of this form also for [a,r'], where r' is the 
left-most element of an interval with the smallest k value satisfying I^^i' C S. The lemma 
follows by taking the union of these two decompositions. □ 

We now prove 
Lemma 13. In the notation above, we have that 



1/2 



(6) 



Proof. By rounding up to the nearest power of two, we can assume without loss of generality 
that N = 2^ for some positive integer £ (this change will only affect the constants absorbed 
by the <C notation). Now, for each a;, we have some disjoint intervals Ji, . . . , C [N] such 
that: 



l|S[/lllv"W = 



j=l \n£jj 



It is important to note that these intervals depend on x. 

By Lemma Il2l each Jj can be decomposed as a disjoint union of the form ([5]). In this 
disjoint union of intervals Ik^.i„^, each value of im appears at most twice. For each j and i, 
we let If denote the union of the (at most two) intervals in the decomposition of Jj which 
are of length 2*. We then have: 



Applying the triangle inequality for the P norm, this is: 



< 



E E 



Now, since each If is a union of at most two intervals, this implies: 



ii5[/]ik.(x)«E^ 



^ k=' 



E E """^"C^^) 



(7) 



\nelk 



Notice that we are now summing over all intervals 7^.^ for each i, regardless of the value of 

X. 

We take the norm of both sides of ([7]), and apply the triangle inequality to obtain: 



l|5[/]|U^(y2)«E 

1=0 

By linearity of the integral and Parseval's identity, we have that 



2«- 



A E E 

\ k=Q \n£lk,i 



(8) 



E E ""'^"(2^) 

fc=0 \nG/fc,i 



N 



= 1 E E =|E«?. 

k=0 nelk,i 



\7l=l 



L2 



6 



for each i. Combining this with ([5]) and noting that there are ^ In values of i, we have: 

/ oo \ 1/2 



\\S[f]\\L^V-)«HN)\Y.K\ 



□ 



We now define a variant of the function ||5'[/]||y2 (x) which we will denote by 1 1 5'l [/] 1 1 (x). 
For each x, we define S'l[/](x) to be the sequence of differences of lacunary partial sums 
of/atx, i.e. Si^[f]{x) -.^ {S-2o[f]ix), S-2i[fKx) - S^oifKx), S^^mix) - S^ilfKx), . . .}. As 
usual, we let ||S'l[/]||v'2(x) denote the 2- variation of this function. 

Lemma 14. In the notation above we have that 

\\SL[mL-(v-) « (EM" + i)i«nn • 

Proof. We will need the inequality |ap < 2|a — fep + 2|6p for any real numbers a, 6. For 
each X, there exists some sequence mo(a;), mi(x), m2(x), . . . such that: 

oo 
1=1 

Setting a := Sj^a-) [f]{x) - S^m,_i(:.) [f]ix) and b := /(x) - S'2™i_i(x) [f]ix), we can apply 
the inequality above to obtain: 

|52"^.w[/](^) - V.-i(^)[/K2^)r < 2 - /(a;)|V 2 I - /(a;)!' 

for each i > 1. Combining this with we have: 

oo 

II^L[/]||^.(:r) « IS^^o,., [f]{x)\' + J2 |52.".<=^) [/](^) - /(^)l' + I V.-i(^) [/] W - /(^)l' 

oo 

« |^2"o<.,[/](a:)r + ^ |52.".<=")[/](^) - fi^ f 

oo 

« |^2"0<., [f]ix)\' + ^ - f{x)\' . 

m=0 

Note that in this last quantity, we are always summing over all values of m, instead of 
summing over a subsequence dependent on x. 
This gives us 

||5L[/]||y.(x)« (|52„,,(.,[/](x)P+f;|52™[/](x)-/(x)|M . 

V m=0 / 

Now we take the norm of both sides of this inequality to obtain: 

(oo \ 2 

To see this, note that 15*2'" [/] (2:) ~ f{x)\ — |X]^2™+i '^n4>n{x)\ and each n is greater than 
2™ for ^ ln(n) values of to. The result then follows from Parseval's identity. 

□ 

We now combine these two results to prove the following theorem. 
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Theorem 15. For an arbitrary ONS, in the notation above, we have 



\S[f]\\mv^)^[Y,\n\n + l)al 



\n=l 



Proof. We write Uk{x) := X]ri=2'«-i+i '^"'^"(^) ('^hen fc = 0, Uq{x) := ai(f)i{x).). We claim 
that 

\\S[f]\\h^v.) « ^ M|^L[/]||^.(a:) + £ imU^)] dx. 



k=0 



To see this, note that any interval [a, b] can be decomposed as the disjoint union of at 
most three intervals Ii,Ic,Ir, where 4 = (2*^, 2*^'] and /; C (2'=-i,2'=] and Ir C (2'=', 2'='+i) 
(here, 2*^ can be set as the smallest integral power of 2 contained in [a,b], and 2*^ can 
be set as the largest integral power of 2 contained in [a, 6]). Now, /.j, ||S'L[/]||y2(a;)rfa; ^ 
^^j^ ln(n+l)|a„p from the previous lemma, which is clearly bounded by X^^^i lii^(?i+l)a^. 
By Lemma [T51 we have 



2" 2" 



I \\Uu\\l.{x)dx^\n\2'^ + l) E In2(n+l)a2. 

n=2'=-i + l n=2'=-i + l 

Combining these estimates completes the proof. □ 

Next we show that these estimates can be improved if one has additional information 
regarding the ONS. In particular, if the partial sum maximal operator A4 associated to the 
system is bounded then one can replace the In^(n) above with an ln(n). 

Theorem 16. Let f{x) = Yl,n=i '^n4>n{x) and assume that \ |A^/| II^ <C A(A^) (^Yl,n=i K 
for any choice of f . Then 

/ TV \ 1/2 

and 

1/2 



1/2 



In particular, if the quantity on the right is finite, then the variational operator applied to f 
must be finite almost everywhere. 

Proof. As before, without loss of generality, we may assume that N ~ 2^ for some positive 
integer £. And we consider the collection of dyadic subintervals of [1, A^] of the form Ik^i = 
{k2\ {k + 1)2'] for each 0<i<^, 0<fc< 2'''-' - 1. We wih refer to intervals of this form 
as admissible intervals. 

Now we note that an arbitrary interval J = [a, b] C [A^] can be written as a disjoint union 
J = J, U Jr, where Jr C 4^^.,^ and Ji C 4,^^, and |J/| > l\Iki,zi\ and \Jr\ > ^Ih^.iJ- We 
allow one of the intervals to be empty if needed, although in the following we will always 
assume that the intervals are not empty, since estimating the contribution from an empty 
interval is trivial. That is, we can write an arbitrary interval J as the union of two intervals 
which are contained within admissible intervals and the intersection with the admissible 
intervals is a constant fraction of the the admissible interval. 

For J C [Af], let Sj ^„ ^jan4>n{x). We now claim the pointwise inequality 

o<i<e o<fc<2«-'-i 
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Note that the sum on the right is only over all admissible intervals. To see that this 
inequality holds, let {Ji}™ x be a partition of [N] that maximizes the square variation 
(at x). From the discussion above, we can associate disjoint Jl and to Ji such that 
Ji C Jj' U J[. Moreover, we can find disjoint admissible intervals ll and If such that Jf C If 
and |Jf I > i|/f I {se{r,l}). 

We observe that |S',7,(a;)|2 < \MSji{x)\'^ + \MSir{x)\'^. Moreover, any particular ad- 
missible interval / will be associated to at most two intervals in the partition {Ji} since 
the intervals in the partition are disjoint and have at least half the length of the associated 
admissible interval. The pointwise inequality above now follows. Now integrating each side, 
applying the hypothesized inequality ||A^S'j||^2 <C ^^i^)J2nej^'rn noting that every 
point in [N] is in 0(In(iV)) admissible intervals, we have that 

\\f\\l.dx« E E / \MSi,_^ix)\^dx 

0<i<£ 0<fc<2«-'-l^ 
N 

n=l 

Taking the square root of each side completes the the proof of the first inequality in the 
theorem statement. The second statement follows from the first via the argument used to 
prove Theorem II 51 Note that we obtained a bound on the lacunary partial sums in Lemma 
[Til of the order ^y\n{n). This estimate was better than we needed for the proof of Theorem 
1151 however is exactly the order we need here. □ 

This completes the proof of Theorem |3] and Corollary |4] follows. 



/ 



4 Lower bounds 

In this section, we prove: 

Theorem 5. Let {0„(a;)} be a complete ONS. Then there exists a function f € L°°{T) such 
that for almost every x G T 

||/||v2(x) =oo. (10) 

f K \^/^ 

Here, as before, ||/||y2(a;) = sup^^ sup„j,< <„^. 

Etil^nJ/](^)-^n,-i[/](^)n where 

Sni[f]{x) = J2n=i 0'n(t>n{x) is the n/-th partial sum. 

Using Lemma [T7| below and properties of the Dirichlet kernel, Jones and Wang showed 
pop for the trigonometric system. In the case of general orthonormal systems, we do not 
have analytic information regarding the partial summation operator and need to proceed 
differently. We start by establishing the result for the Haar system. 

We let Ek ^ denote the conditional expectation operator defined as follows. For 
X e [12^'', {I + 1)2-'=), < Z < 2*=, Z G N we define 

Ml+l)2-'' 

Ekfix) ^ / f{x)dx. 
Jn-'' 

Using a probabilistic result of Qian [T2], Jones and Wang |^ showed that: 
Lemma 17. (Proposition 8.1 of JVji) There exists f € L°°{T) such that 

1/2 



sup sup i^\EnJ{x) - En,_J{x)\'^\ 
K no<...<nK J 



almost everywhere. 
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If we let Sn[f] denote the partial summation operator with respect to the Haar system, 
then it easily follows that Ekf{x) = Sn^^i [f]{x) — Sn^ [f]{x) for some sequence {n/j}. There- 
fore, there exists / G L°°(T) such that ||/||y2(a:) = oo for almost every x G T, where the 
operator V'^ is associated to the Haar system. For future use, let us define {bn} to be the 
Haar coefficients of the function /, that is 

bn^{fix),Hn{x)) . (11) 

We will also need a theorem of Olevskii (see [13] Chapter 3), which requires that we 
introduce some additional notation. Let and {/n} be two sequences of real-valued 

measurable functions on T. We say that they are weakly isomorphic if for each n G N there 
exists an invertible measure-preserving mapping T„ : T — > T that is one-to-one on a set of 
full measure and satisfies 



fk{Tnx) = gk{x) 



for all 1 < fc < 



Theorem 18. (Olevskii) Let {0n}^i be a complete real-valued orthonormal system. There 
exists an orthonormal system {Hk}'^i that is weakly isomorphic to the Haar system, and a 
sequence {".fc}fc^i such that 



"fc+i 

E 

i—nk-\-l 



< 2 



-k-j 



L2 



whenever j ^ k. 

We now set f{x) :— Yl^=i^nHn{x), for &„ defined in PT|) . Using the fact that the 
(finite) partial sums of the series defining f{x) are weakly isomorphic to the partial sums 
of the Haar expansion of /, it follows that the partial sums of the function / are uniformly 
bounded, hence / e L°°{T). 



Lemma 19. For f defined as above, we set Cn '■= (^f,(j)nj- It follows that 

^ c„0„(a;) = bkHk{x) + ek{x), 



n=nfc+l 



where J2k \^k{x)\ < oo for almost every x. 
Proof. Since f{x) = ^jHj{x), we have 



+ 1 n=nfc + l \j=l I 



= ^ bk{IIk{x),4>n{x)) 4>n{x) + ^ ( y^bjllj{x),4>n{x) j 4>n{x) . 

By applying the triangle inequality, we obtain: 



"fc+i 



bkHk(x) - ^ c„0„(a 



< \bk 



L2 



E {Hk{x),(t)n{x)) (t)n{x) 

n^[nfc + l,nfc+i] 



E {Hj{x),(j)n{x)) (l)n{x) 



L2 
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Now applying Theorem [THl we have that 



bkHkix 



Cn(f>nix) 

nfc + 1 



L2 \ j/fe j^fe 



The last bound follows from the fact that \bj\ < 



Denoting the expression on the inside of the norm on the left as ek{x), we sec that 



J2k=i \^k\\\]^2 II/IIl^ and hence J2k=i \^kix)\ is finite for almost every a; e T. 



□ 



We now prove Theorem [S] We let and Vh denote the variation operators associated 
to the systems and {Hn} respectively. Moreover, we let V'^ be the variation operator 

associated to the partial sums of the absolutely convergent function E(x) = '^'^Li Skix). 
We have, for almost every x gT, 



\E\\y2{x) < ^ \ekix)\ < oo. 



fc=i 



It follows that 



E 

k=l 



bkH, 



< 



\E\ 



Since the first quantity in this expression is infinite almost everywhere, and the third quantity 
is finite almost everywhere, it must hold that ||/||L2(y2-) is infinite almost everywhere. This 
completes the proof of the theorem. 

Our proof of Theorem [5] was purely qualitative, a feature we inherit from Theorem I18L 
which relies on the Riemann-Lebesgue lemma. Next we show that it is impossible to obtain 
a quantitative lower bound on the growth of the variation in Theorem [51 

Remark 20. One could obtain the conclusion of Theorem\^ for functions in more restrictive 
classes. Combining the above argument with known perturbation techniques, one can show 
that the f in the statement of the theorem can be taken to be continuous. The proof of this 
relies on the fact that one already has an example in L°° (an example in is not sufficient). 
See p. 67 and the associated references for details. Additionally, one can show that for 
any nonconstant function f , there exists an invertible measure preserving transformation of 
r : T — > T such that the conclusion holds for g{x) = f{T{x)). See US\/ p. 69 and the related 
references for details. From this, we see that one cannot hope to prove that is bounded 
on even in "restricted weak type" form, at least not for complete systems. Since the 
details of these arguments are not essential to our current investigation, and are essentially 
a combination of the above argument and the ideas of the cited papers, we omit them. 

Theorem 21. Let w{-) denote a positive real-valued function monotonically increasing to 
infinity. Then there exists a complete orthonormal system {(/iri}^! such that for all suffi- 
ciently large N Cz N, 

11/1^2(^2) « u;(iV) |^^^Ja„|2^ . 
for all f of the form f{x) = X^^Li (a;). 

Proof. Our example will be a rearrangement of the Haar system. We let ^ = {V'nCa;)}^! 
be a subsequence of the Haar system with disjoint supports. We let {pn{x)}'^^i denote the 
subsequence of the Haar system consisting of all the elements of the Haar system that are 
not included in VP. We now form a complete orthonormal system {4>n\ by sparsely inserting 
elements of the sequence {pn{x)}'^^i into the sequence {'>pn{x)}'^^n maintaining the relative 
ordering of each sequence. Clearly we may do this so that the first N elements of the system 
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{(p,i} have at most w{n) elements from the p's. We thus may partition the indices [N] of the 
system {<j)n}n^i into two classes. We let S be the subset of indices n for which 0„ = pm for 
some m and S"^ :— [N] \ S. We note that for n G 5"^, 0„ is an element of the subsequence 
5*, and so all of these have disjoint supports. 
We then have: 







< 




+ 




7ies 






Ties 







«ln(z«(n))||/|| 



L2 



,2 «ln(«;(n))||/|U2 « |/| 1^2 . 



Here, we have employed the triangle inequality, Lemma flSl and the fact that {^njneS'^ have 
disjoint supports. 

□ 

Lastly, we show that if a system is uniformly bounded, then an quantitative lower bound 
on the growth of the V'^ operator is available, even without assuming completeness. 

Theorem 6. Let {(f>n}n=i '^^ ONS uniformly bounded by M . Then there exists a function 
of the form f = X]n=i '^n4>n{x) such that 



\\s[f]\\mv^) »M VhMN)\\f\\L^ 

In light of Theorem ini this is best possible. 

To prove this, we will rely on the following lemma: 

Lemma 22. We let ci, . . . , cjv denote real numbers, all > d for some constant S > 0. We let 
Xi, . . . ,Xn denote independent Gaussian random variables, each with mean and variance 
1 . Then 



E 



N 



> S^^N In ln(7V). 



Proof. We essentially follow the proof of Theorem 2.1 in [TB] (pp. 1373-1375), with minor 
modifications. We let $(a;) denote the standard normal distribution function. By Lemma 
2.1 of jl6j (p. 1373), we have that 



1 - $(a;) > (l/12)ea;p(-3a;V4) for x > 1. 



(12) 



We define Sk = z2n=i ^^nXn and we set K :— 25. We also set 



i := £iN) 



InN 
4:\nK 



and m :— m{N) 



IniV 
2\tlK 



We let Lx :— max{l. In a;}. 

For each lo £ Q, (where 51 denotes the probability space), we define Em{(jj) to be the 
subset of values ie{l,2,...,A^ — \/N} such that, for some £ < j < rn, \St+Ki (w) — St{u})\ > 
Sy^ LL{N)/2. Additionally, for each fixed t and j, we define the event 

Ej^it) := {lo : 15^+^,(0.) - W-i(^)l > S^&LLiN)} . 

Now, St+Ki ~St^fcj-i is distributed as a Gaussian random variable with mean and variance 
equal to 

o-^ := Var[St+Kj - St+xi-^] = ^ c^. 

For any A G M, 

P [St+K. (c^) - St+K.-i > A] = 1 - $ ("^ 
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We apply this with A := Ki LL{N), and since each c„ > S, we have: 



KiLL{N) 



A 



Therefore, using (|T2)) . we obtain: 



V[Ej,it)] = I - <P [ ^ ] >l-$ 



IOLL{N) 



> — exp ( 
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4 K3 - KJ 



-LL{N) 



This is > ^exp{~^LL{N)) = j^(hi(iV))-4/5. 

We observe that if \St+Ki{^) ^ •S't+xi-i > Ki LL{N) for some t < j < m, 
then either IS-t+^^^w) - 5t(a;)| > 5^KiLL{N)/2 or ISj+k:--! - 'S'f| > 5^KoLL{N)/2 > 
6^K:i-^LL{N)/2. Thus, 

3=1+1 

Therefore, for any t € {1, 2, . . . ,N — [a/ZYJ }, we have: 



We note that for j' ^ j, E-'^{t) and E-'^{t) depend on disjoint sets of the random variables 



U ^Nit) 

j=e+i 



Xi, and so are independent events. Therefore, letting E''j^{t) denote the complement of 
El^{t), we have 



U ^Nit) 

j=e+i 



= 1 



= 1- n nE^Mit)]- 

■j=e.+i 



By the above computations, this is 

> l-exp(-(l/12)(m-£)(ln7V)-''/5) . 
For sufficiently large N , we can bound this by: 

>l-exp (-(lniV)i/V(521ni\:)) := l-pN- 



This shows that for each t, P [w : i G i?jv(i^)] > 1 — Pn- We can alternately express this 



as: 



lE^it)dP>l-pN, 



where lEjv(f) denotes the function that is equal to 1 when t € En{uj) and equal to 
otherwise. We define the subset 5 C to be the set of w e such that |i?Ar(w)| > 
il-^)iN -VN). Then 

P[5] > 1 - (13) 

To see this, observe that 



n-Vn 



/ ^EAt)dF^ E / ^EAt)dF> iN-VN){l-pN). 

Now, if ¥[S] < 1 — y/pN held, this would imply that the integral on the left hand side of the 
above is also 

<Vmii- Vm) [n - Vtv) + (i - ^) (n - Viv) = [n - Viv) {i^pn) , 

which is a contradiction. 

We next use the following Vitali covering lemma: 
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Lemma 23. (JS^, Lemma 3.15) Let ^{A) denote the Lebesgue measure of a set ACM. Let 
lA he a collection of open intervals in M with hounded union W . Then for any A < ^{W), 
there is a finite, disjoint suhcollection {Vi, V2, . . • , Vq\ C lA such that ^ A/3. 



For sufficiently large iV, (jl3p implies that with probability > 1 — ^PN: for > N' := 
[ ( 1 - {N- \/N- 1 ) J integers t £ {1,2, . . . ,N~ \/N } (we will call them 1 1 , i 2 , • ■ • , i JV' ) : we 
have corresponding values ji jAT' (all < m) such that \S^._^xii ~ Sti\ > S^J Ki'LL{N)/2 
for each i from 1 to N' . We consider the collection U of the open intervals {ti , ti + if^' ) 
for i from 1 to N' . We note that each K^^ > 1. We fix some positive constant a < 1. 
For N sufficiently large, we have N' > aN. (Note that pN approaches as goes to 
infinity). Therefore, the union of the intervals in U is a subset of (0, A^] with Lebesgue 
measure > N' > aN. 

Applying Lemma I23[ we conclude that there is disjoint suhcollection of these open in- 
tervals, denoted by {{ti,ti + K^')}i^Q, where Q C [A^'], such that 



K^^ > aN/3. 



The closures of the intervals in Q are non-overlapping except for possibly at their endpoints. 
Relabeling the ti's for i S Q as ti, . . . ,tq (where q = \Q\), we have ti < ti + K^^ < ^2 < 



t2+K^^ <■■■ <tq<tq 



g 



This implies that 



K^" < N. Then, 



SuY > {l/4)S^^K^^LL{N) > {a/12)S^NLL{N). 



> (5V(a/12)AflnlnA^ 



y2 



> 1 - 



/PN, 



for all sufficiently large A^. Hence, by Markov's inequality. 



E 



N 
n=l 



> (5v/(a/12)A^ln lnAr(l - y/^) > SVN InlnN. 



□ 



We now prove Theorem [51 We begin by noting that for each n, J^(f)^{x)dx — 1 and 
|0n(a;)| < M Va; implies that there are positive constants e,(5 > (depending on M) such 
that for some sets C/„ C T each of measure > e, |0„(x)| > 5 for all x £ [/„. For each n, we 
let Xn denote the characteristic function of the set J7„. We then have: 



„ N ^ r 



Xn{x)dx > Ne. 



(14) 



We define e' :— |. Then the function Xn{x) must be > e'A^ on a set of measure > e'. 



To see this, note that < J2n=i Xn{x) < A^ for all A^. If this function is less than e'A^ on a 
set of measure > 1 — e', this would imply 



N 

E 



Xn{x)dx < e'N{l - e') + e'N = (1 - e/4)Afe, 



contradicting Thus, there is some set U of measure > e' such that for every x £ U, 

\4>n{x)\ > S for at least e'A^ values of n. 
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We let Xi , . . . , X]\f denote independent Gaussian random variables with mean and 
variance 1. We consider the quantity 



E 



This can be written as: 



E 



\\{Xn<l>n{x)} 



N 



dx 



[ [ ||{X„(/.„(x)}^^i| 
Jn Jt 



V2 



dxc 



By Fubini's theorem, we may exchange the integrals to obtain 

= / / \ \{XMx)}'^^^\\l^dFdx. 
JT Jn 



Since the inner integral is a non-negative quantity, this is 



> 



{X^Mx)}n=l 



11/2 



dx. 



We consider a fixed x E U. By definition of U, we have > 6 for at least e'N values 

of n. We now define new independent Gaussian random variables Yi, . . . , for > e'A^ 
as follows. We start from n — 1, and we define Yi to be the first partial sum J2n=i 4'n{x)Xn 
such that J2n=i l<^n(^)l — ^- We then similarly define Y2 to be X^nlm+i 4'n{x)Xn for the 
smallest n2 such that J2n=ni+i \'Pn{x)\ > S. We continue this process, defining the F^'s to 
be disjoint sums of the (f>n{x)Xn's. Since x G U, we will have Yi, . . . , Y^ for N > e'N. Since 
the sum of independent Gaussians is distributed as a Gaussian (with variance equal to the 
sum of the variances), each Yi is distributed as an independent, mean zero Gaussian with 
variance > S^. Thus, applying Lemma [HJ we have for each x G U: 



E 



{x,,Mx)}^=i 



Iy2 



> 



> (52iVlnln(7V) > 5^Nln\n{N). 



Therefore, we have 



E 



{XnMx)}n=i\\L2^y2i\:S> / d^N\nln{N)dx:> NlnlnN. 



(15) 



We note that the constants being subsumed by the » notation above depend on M. 

Now, we consider the contribution to this expectation from points w in the probability 



space ri such that J2n=i ^ni^)"^ is much larger than A^. We will show this contribution is 



AT 



X^ > kN 



for each positive 



small. To do this, we will upper bound the quantity 
integer k > 2. We rely on the following version of the Berry-Esseen theorem. 

Lemma 24. (\14^ , p. 132) Let Zi, . . . , Zn be independent, mean zero random variables 
with E[|Z„|2+')'] < 00 for all n for some < 7 < 1. Let al := ¥.[Zl] and Bn := Y.n=i ^n- 
Then, for all x G M; 



B 



N 



N 

E 



Zn < X 



$(j;) 



< 



A 



N 



i?^+^/^(i + N)2+7,„-rt 



|2+71 



where A is a constant and $(a;) denotes the standard normal distribution function. 

Now, letting Xi, . . . , Xj^ denote the independent, mean zero, variance one Gaussians as 
above, wc define Zi, . . . , Zj^ by Z„ :— X^ — 1. Then the Z„'s are independent, mean zero 
random variables. We note that E[Z^] = E[X,'J] — 1 = 2 for each n. Also, 

E[|Z„|3] = E[|X6 - 3X^ + iXl - 1|] < E[X^] + 'M[Xi] + 2,^X1] + 1 = 28. 
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We will apply Lemma [Ml for Zi, . . . , Zjv, with 7 := 1 and Bn = 2N (since cr^ — 2 for each 
n). We observe: 



N 



Y.Xl>kN 



N 



N 



= 1 



N 



< 1 - ^{x) + 



A 



N 



where x := 2-^/^{k - 1)N^^^. 

Since E [l^nP] is a constant, this is 



7V1/2(1 + |2;|)3- 



Using that a; = 2-'^/^{k - l)N^^^, we have 

1 



A/-l/2(l + |a;|)3 iV2(fc-l)3' 

Since a; > 1 (recall that k > 2), we have 



e ^ dy < ye ^ dy ^ e 
Combining and ([TT]). we see that 



AT 



iV2(fc- 1)3 



(16) 



(17) 



for each positive integer k > 2. 

Now, by Lemma [T51 for each e f2 such that kN < ^nl'^) < (fc + 1)-^, we have 

that the quantity | |{-'fn'/>n(a;)}^^i| |^2(y2) evaluated at cj is <C (fc + l)ln^(A^)A^. Thus, the 
contribution to the expectation bounded in (1151) coming from such points lo for all fc > 2 is 
upper bounded as: 



<^(fc + l)ln2(7V)7V 



k-2 



g-iW(fe-l) 



1 



ln2(7V)7Ve~3~^(fc + 1) 



k=2 



e 4 



7V2(fc-l)3^ 

j_^sfc^-2fe ln2(iV)^ k + 1 



^ (fc - 1) 



iv ^,(fc-i)^ 



Both of these sums are convergent, and it is easy to see that this quantity is o(A^lnln A^). 

Therefore, by (jlSp and the above bounds, we have proven that there exists some point 
u! n such that when we define a„ := Xn{uj) and define f{x) = X]^=i o,n4>n{x), we have 



'^n=i '^n — 2^ simultaneously. 



\\S[f]\\mv-)->M 0nln(iV)||/|U.. 



Here, we have used that we can choose ui so that ||'S'[/]||2 21-^2-) A^lnln(Af) and 11/11^2 



5 Systems of Bounded Independent Random Variables 



In this section, we prove the following theorem: 
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Theorem 9. Let {Xi}^^ be a sequence of mean zero independent random variables such 
that \Xi\<C and E [\X,\^] = 1 for all i E [N]. Then 



E [\\{a..X.,}l,\\^,] «c V'kMN) (E«^') 



1/2 



We will require the following lemmas. The first is a form of HoefFding's inequality [S]. 
Lemma 25. Let {Xi} be independent random variables such that ¥[Xi G [oi, hi]] ~ 1. Then 

2t^ 



P[|5„-E[5„]|>i]<2cxp 
where Sn = Z]"=i ^i- 

Lemma 26. (Etemadi's Inequality). (See Theorem 1 in 14-!-) Let Xi, X2, ■ ■ ■ ■, Xn denote 
independent random variables and let a > 0. Let Si :— Xi + ■ ■ ■ + X^ denote the partial sum. 
Then 

P[ max \Si\ > 3a] < 3 max V[\Si\ > a]. 

l<£<n l<£<n 

Lemma 27. (Rosenthal's Inequality) . (See Theorem 3 in JTS^-) Let 2 < p < 00. Then there 
exists a constant Kp depending only on p, so that if Xi , . . . , Xn are independent random 
variables with E[Xi] — for all i and ¥.[\Xi\P] < 00 for all i, then: 




(E[|5„n)^/^<X, maxi ij2E[\X,\^]] , X^E[|X 



We also use the following consequence of Doob's inequality. For an interval / C [n], we 
define Si := X^ie/^i- We also define 



Sn '■= max \Si\ 

/Cfnl 



We then have: 



Lemma 28. For p > 1 and independent random variables Xi, . . . , Xn with E[Xi\ = for 
all i, 

9 p-\ 



E 



< 2n 



max 

KKn 



<2P {-^\[\Sn\^] 



Proof. The first inequality is a consequence of the following observation. For a subinterval 
/ C [n], we let /q be the subinterval that starts at 1 and ends just before /, and we let Ji 
be the interval /q U /. Then /q and Ii are both intervals starting at 1, and Si^ + Sj = 5/^. 
Therefore, max{|S'/(,|, |S'/J} > ^IS*/!. The second inequality follows from Theorem 3.4 on p. 
317 in [3]. □ 

We begin by decomposing [N] into a family of subintervals according to a concept of 
mass defined with respect to the values. We define the mass of a subinterval / C [N] 
as M{I) :— J2nei'^n- normalization, we may assume that M{[N]) = 1. We define 
Io,i '■— W] and we iteratively define Ik^s, for 1 < s < 2'^, as follows. Assuming we have 
already defined Ik-i,s for all 1 < s < 2'^"^, we will define Ik,2s-i and Ik,2s, which are 
subintervals of Ik-i,s- Ik,2s-i begins at the left endpoint of Ik~i,s and extends to the right 
as far as possible while covering strictly less than half the mass of Ik-i.s, while Ik,2s ends 
at the right endpoint of Ik-i,s and extends to the left as far as possible while covering at 
most half the mass of Ik-i.s- More formally, we define Ik.2s-i as the maximal subinterval of 
Ik-i,s which contains the left endpoint of Ik-i.s and satisfies M{Ik^2s~i) < ■^M{Ik,s)- We 
also define Ik.2s as the maximal subinterval of Ik~i.s which contains the right endpoint of 
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Ik-i.s and satisfies M{Ik.2s) < ■^M{Ik^s)- We note that these subintervals are disjoint. We 
may express Ik-i,s = 4,2s-i U h,2s U where ik,s G 4-i,s- In other words, ik^s denotes 
the single element which lies between Ik,2s-i and Ik,2s (note that such a point always exists 
because we have required that Ik.2s-i contains strictly less than half of the mass of the 
interval). Here it is acceptable, and in many instances necessary, for some choices of the 
intervals in this decomposition to be empty. By construction we have that 

M{h,s) < 2-\ (18) 

We call an interval J C [N] admissible if it is an element of the decomposition given 
above. We denote the collection of admissible intervals by A. We additionally refer to 
the subset {/fc,s|l < s < 2*"'} of A as the admissible intervals on level k and the subset 
{*fc,s|l < s < 2^} as the admissible points on level k. We note that every point in \N] is an 
admissible point on some level. (Eventually, we have subdivided all intervals down to being 
single elements.) 

We consider an arbitrary interval J C [N]. We would like to approximate J by an 
admissible interval J such that J C J and M{J) < cM{J), for some constant c. This 
may be impossible, however, since J could span the boundary between adjacent admissible 
intervals for all comparable masses. To address this, we will instead approximate J by the 
union of two admissible intervals and one point. 

Lemma 29. For every J C [N], (J ^ %) there exist Ji,Jr £ A and ij £ [N] such that 
J :— JeU ij U Jr is an interval (i.e. Ji, ij, Jf are adjacent), J Q J , and M{J) < 2M{J). 

Proof. We consider the minimal value k such that J contains an admissible point on level k. 
We note that this point is unique, and we define ij to be equal to it. To see why a unique 
such point exists, first note that if J contained at least two admissible points on level fc, then 
it would also contain an admissible point between them on level fc — 1. Now we consider 
the subinterval Je consisting of elements of J that lie to the left of ij . Since the rightmost 
endpoint of this subinterval is at rightmost endpoint of an admissible interval on level fc, it 
is also a rightmost endpoint of some admissible interval on every level > k. We define Je to 
be the admissible interval with this right endpoint on the highest level ki such that Je Q Je- 
We note that the admissible interval with this right endpoint on level fc contains J, so such 
an interval Je must exist, and kg > k. 

We claim that M{Je) < 2M{Je). To prove this, we consider the admissible interval J' on 
level kg + I with this same right endpoint. By maximality of fc^, wc must have that J ^ J' . 
This implies that J must contain the admissible point on level ke + 1 that occurs when Je 
is decomposed. Therefore, M{Je) > ^M{Je). 

We define the subinterval Jr consisting of elements of J that lie to the right of ij , and 
we can similarly find an admissible Jr such that Jr C and M{Jr) < 2M{Jr). We then 
have J 'Z J := JeUijU .J- and M{J) < 2M{J) follows from: 

M( J) = M{Je) + M{ij) + M{Jr) < 2{M{Je) + M{ij) + M(J,.)) = 2M{J). 

□ 

Defining Je, Jr, and ij with respect to J as in the lemma, we observe that: 

\Sj\'<^\Sjf + \Sjf + \S.,\\ (19) 

Here, \Sj\ is the maximal partial sum over all subintervals contained in J. Also, if V is 
a partition of [N], then the admissible intervals and points {Je, Jr, and ij) associated to 
an element J of the partition will only reoccur for a bounded number of elements of the 
partition (i.e. a particular admissible interval/point will only appear among Je,Jr,ij for a 
constant number of J £V). This is because the J's in V are disjoint, so ij £ J for only one 
J £V, and M( J n Je) > \ Je implies Je can appear for at most two J's in V. 

Now we will prove Theorem [HI We let $7 denote the probability space for Xi, . . . ,Xm 
(each cj in is associated to a sequence of N real numbers). For each a; € £7, we let V^i 



18 



denote a maximizing partition. We define V^j.^ (resp. V^j^r) to be the set of Jp (resp. Jr) 
associated to J e V^i- We note that the same interval could appear as Ji or Jr for up to 
two different J's in Vui- 

We fix a large constant B which will be specified later. Now we split each set 'Pu:,sido 
(here side e {I, r}) into two disjoint subsets V^°^^ and V]^'^^^^^. We define T"^°^^ to be the 
set of J e T^w.sidc such that 



< BM{J) Inln(iV). 



(20) 



We then define V^'^^^^ to be the complement of V^"^^ inside 7^^,81(10 
Our objective is to prove the estimate 

E Yl l^-^l' <lnln(^). 
Using we upper bound the left side as follows: 



E 






E 


+ E 


E 








Jg.pgood 







E 


+ E 


E 


+ E 


E i^-i' 


Jgpbad 











E i^-i' 


<E 


E KXn\^ 


< 1 






Tie[Af] 





We observe that 'Zjev'^''^ \^ ^ (SjeP^ „dc ^^('^)) Inln(A^) < lnln(7V). This holds 
because X^je-p M{J) = 1, and the total mass of the intervals Ji, Jr,ij used to cover each J 
is at most 2M( J), thus X^Jgp ^ M{J) < 2. This shows that the terms involving the good 
admissible intervals are easily controlled. The last term is also easily controlled as follows 



E 



It remains to control the terms involving the bad admissible intervals. The argument 
is essentially the same for both the sums over V^'^f^ and V^^^^^ so we will work with the 

quantity E J^Jev^'"^ I'^'jP in what follows. 

We now partition V^^^f^^^ into two disjoint sets V^^^^'J^^ and V^^^^'^^. The set 'P^^J^de consists 
of intervals Ik,s & 'P^^sMo such that \Ik,s\ < 2~'^/^iV and V^^^^^'^^ contains the complement 
set. For each k, we define Tk C {Ik,s '■ ^ < s < 2''} as the collection of all intervals Ik,s 
satisfying \Ik^s \ > 2-''/'^N. Clearly, \Tk\ < 2*^/^ foj. g^^h k. We then have: 



E 1^. 



< E 



E E i^ii' 



oo 

EE^i 



Using ^ and the fact that E 



OO 

EE^[i 



'==1 Jen 

< E [IS'jp] (by Lemma (Ml), we have 

OO 

« E E ^ « E « 1. 



\Sj\' 



^==1 Jen '^'=1 Jen 

It now suffices to bound the more difficult term E 



fc=i 



^^^gpbad 1 \Sj\' 



19 



Now \Ik^s\ < 2'''/'^N if Ik^s e ^^!!idc- For a fixed interval J, we let B{J) C fl denote 

_ 2 

the event that the \Sj{uj)\^ is bad. In other words, uj e B{J) if S'j(a;) > BM{ J) lnln(7V). 
We let denote the complement of Tk- We now have that 



E 



E 1^. 



21n(Ar) 

^==1 JeT; 



Here we have restricted the the summation of k to the range 1 < fc < 2 In(A^) using the fact 
that 1 < |/fc,s| < 2-'=/2iV implies k < 21n(iV). 

We let 7 > denote a positive value to be specified later. Letting 2p := 2 + 7 and 
applying Lemma [57] (Rosenthal's inequality) we have that 



(E O^jp^']) 









/ 






2+7" 










E 

V 








) 



<C max < 



(21) 



\neJ 



The last inequality follows from the fact that the i"^ norm is greater than the i'^^'' norm 
andE[|X,|2+T] < C2+^. 

We let s :— |J|, and we let Sj g denote the sum of UiXi for the first £ indices i in J. By 

definition of the event B{J), we have: 

2 ^ ' ' 



E 



By Lemma this is 



> BM{J) Inln(iV) 



<C max 1 

l<i<s 



S 



J J 



< 



B 



max 

KKs 



^ B ~ 
> — M(J)lnln(iV) 



> — M(J)lnln(7V) 



By Lemma this is: 
< exp 



BAI{J) In ln(iV)\ 
3C2Af(J) ] 



exp 



Sin ln(7V)\ 



3C2 



By setting the value of B to be sufficiently large with respect to the constant C (i.e. B > 
12C^), we have: 



E 



BiJ) 



(22) 



that 



We now define g as a function of p so that i + i = 1, i.e. q = We then set 7 such 



(23) 



for all J. (Recall that p :— ^-j^.) We now apply Holder's inequality with p and q to obtain: 



21n(Af) 



E E^[ 



1 ~ 92 



21n(Af) 

^ E E(^ 



fc=l jgT,- 

Using (PT|) . and Lemma BHl we see this is 



2p 



2ln{N) 2ln{N) 

« E E ln-^(A^)M(J) « Y: In-^(A^) « j-^. 



This completes the proof. 
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6 Random Permutations 



In this section, we will use probabilistic techniques to prove the following theorem: 

Theorem 7. Let {(pn}n=i ^'^ orthonormal system such that \(l)n{x)\ = 1 for all n and 
all a; G T, and {an}n=i O' choice of (complex) coefficients. Then there exists a permutation 
TT :[N]^ [N] such that 



N 



1/2 



|{«7r(n)0w(n)}n=l 



Proof. We assume without loss of generality that 
exists some non- negative integer j such that 2~^~^ < \a, 
let Aj denote the set of n € [A''] such that 2~^~^ < |o„p 
^* Ujl|'2inAf] ^i- ^1^0 define 



= 1. Then, for each a„, there 
< 2~^. For each fixed j, we 
< We define A* C [AT] as 



We then observe, for any permutation tt : 

\\{K{n)(t>7v{n) {^)}n=l\\v2 



an, n€ A* 
0, n ^ A*. 

[N] [TV] and any xeT, 
1 



JV 
n=l 



AT 



Ar< 1. 



Applying the triangle inequality for the 1 1 • 1 1 y2 norm, this allows us to ignore the contribution 
of all terms a„ where n & A*. 

We consider the class of permutations tt : [A^] — >• [A^] such that 7r~^(Aj) is an interval 
for each j. In other words, these are permutations which group the elements of each Aj 
together. We allow arbitrary orderings within each group and an arbitrary ordering of the 
groups. For a fixed permutation tt, we let Bj denote the preimage of Aj under tt (so Bj is an 
interval). We will refer to the intervals Bj as "blocks". From this point onward, we will only 
consider permutations belonging to this class, and we will only consider the contribution 
of terms for Ai up to ^[2in(Af)j- We let A''' := \Ai\ + ••• + |^[2in(Af)j I- For notational 

convenience, we assume that tt maps [A'''] bijectively to Ui=i"^^^'' ^i- (This is without loss 

of generality, since we have seen that we can treat the set A* separately.) 

For each fixed permutation tt : [A^] — >■ [A^] in this class and each fixed a; S T, we consider 
the quantity 

lev 

where V denotes the maximizing partition of [N']. 

, 2 

We now define two additional operators, and Vg. The value of {a7r(n)<?^7r(n) {x)}n=i 
is defined as 

2 



aTr(n)4>Tr(n) {x) 



(24) 



{«7r(n)07r(n) {x)}n 



N' 



E 

leVL 



'^aTT(n)(f>-K(n) [x) 



where T-"/, is the maximizing partition among the subset of partitions of [A^'] that use only 



intervals which are unions of the Bj's. 



The value of 



is defined as 



{aT,{n)(t>T^(n){x)}n=l 



E 

leVs 



nel 



a,r(n)'^7i-(n) {x) 
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where Vs is the maximizing partition among the subset of partitions of [N'] that use only 
intervals / that are contained in some Bj. This can be alternatively described as taking 
that maximizing partition of each Bj and then taking a union of these to form Vs- 
We now claim: 



{aTr(n)'l>TT(n){x)) 



N' 



N' 



{a7r(n)07r(n)(a;)} 



N' 



(25) 



To see this, consider the maximizing partition V in ([M)) . Each I E V can be expressed as 
the union of three disjoint intervals, Is^, II, and Is^, where Is^ and Is^ are each contained 
in some Bi, and II is a union of BiS. More precisely, II is the union of all the intervals Bj 
that are contained in J, /g^ goes from the left endpoint of / until the left endpoint of II, 
and Is^ goes from the right endpoint of until the right endpoint of /. By construction, 
each of Is,, and Is^ is contained in some Bj. (Some of II, Is^i ^Si rnay be empty.) Thus, 



aTr(n)<t>TT(n){x) 

nelL 



7 , aTT{n)(t>Tr{n)ix) 



{x) 



Now, if we consider the set of intervals II corresponding to I G V, we get a disjoint set 
of intervals that can occur as part of a partition considered by the operator V^. Similarly, 
if we consider the set of intervals Isnls^ corresponding to I G V, we get a disjoint set of 
intervals that can occur as part of a partition considered by the operator Vg. Therefore, 



1. 

lev 



nel 



nel 



E 

leVs 



nel 



The inequality ((25|) then follows. 

We first bound the contribution of the operator. For each Bj, we define the function 
fj-.T^C as: 

(26) 



/j(^) ■= E (^^{n)'t>iT{n){x) 
neBj 



Since the sets Bj are disjoint, we note that the functions fj are orthogonal to each other, 
but they may not be uniformly bounded. We need to show that there exists a permutation 
a : [[21n(iV)J] ^ [[21n(iV)J] of the Jj values such that 

1/2 




< Vlnln(iV) 



(27) 



This would imply that there is some ordering of the blocks for which the contribution of the 
Vl operator is suitably bounded. 

To show (|27p . we will use the following inequality of Garsia for real numbers: 



Lemma 30. (See Theorem 3.6.15 in [Sj.) Let xi,. 
permutation ip of [M] uniformly at random. Then: 



, xm G IR- We consider choosing a 



E 



max {xMi) 
i<k<M ^ ' 




M 



We derive the following corollary: 



Corollary 31. Let xi, . . . ,xm G IR- Let L be a positive integer, I < L < M. Let V denote 
the partition of [M] into intervals of size L (starting with [L]), except that the last interval 
may be of smaller size (when L does not divide M). We consider choosing a permutation 
of [M] uniformly at random. Then: 



E 



Eniax 
I'CI 

lev " 



E ^^u) 




\S\=L 



E-? 
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We note here that S ranges over all subsets of [AI] of size L. 



Proof. By linearity of expectation, we first observe: 

EE 



lev - \jei' / lev - \jei' / 



This quantity is then 



M 
< — E 
ij 



max 
rei 



Kiel' 



■4>{j) 



where / is any fixed interval of size L (without loss of generality, we may take / to be [L]). 

For any subset S C [M] of size L, the probability that V' ruaps / to 5 is (^^) . Condi- 
tioned on this event, the action of "0 on / acts as random permutation of the values Xj for 
j € S. Applying Lemma [501 we then have the expectation (still conditioned on ip mapping 

/ to S) is <C (j2j^s ^i) + ^jes ^j- (Note that the maximum over all subintervals /' of I 
is bounded by a constant times the maximum over subintervals starting at the left endpoint 
of /, as in the lemma.) Thus, 



E 



max I J2 '^i'U) 
jer 



SC[M] 

\S\=L 



jes 



Since f(-) 



the corollary follows. 



□ 



We now decompose [[21n(iV)J] into a family of dyadic intervals. More precisely, we 
consider all dyadic intervals of the form 

((c- l)2^c2^], ^ e {0,1,..., [ln(21niV)]}, cG . . . ^ 2ri"'"W+i"2l-f | 

(Some of these intervals may go beyond M := [21n(A^)J. For these, we consider their 
intersection with [M].) The exponent £ of an interval here defines its "level". In other 
words, we say an interval ((c— 1)2^, c2^] is on level £. We let denote the set of all intervals 
of this form. 

We then have that for any interval /' C [M], there are (at most) two adjacent intervals 
Ii,Ir € such that /' C IiU Ir, and |// U Xr| < 4|/'| (when only one interval is needed, 
one of /;,/,■ can be substituted by 0). To see this, consider the smallest positive integer k 
such that |/'| < 2*^. Then either /' is contained in some dyadic interval of length 2*^, or it 
contains exactly one right endpoint of such an interval. We then take /; to the be interval 
on level k with this right endpoint, and take Ir to be the next interval (with this as its open 
left endpoint). 

This implies the following upper bound for each permutation a and each x gT: 



{fa{j){x)}j^l 



L21n(JV)J 



^ > max 

^ I'CI 



(28) 



This holds because for each interval J in the maximizing partition, J C_ Ii (J Ir for some 
Ir,Ii G with |/| < 4|// U Ir\- Each I E T will correspond to at most a constant number 
of J's (it can only be J; for one J when Ir is non-empty, Ir for one J when // is non-empty, 
and it can contain at most 3 corresponding J's), and this constant factor is absorbed by the 
<^ notation. 
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We consider choosing cr uniformly at random. We observe by Fubini's theorem: 



E 



dx 



= / E 



{/a(j)(a;)}_,=i 



L21n(JV)J 



dx. 



Using the triangle inequality for the 1 1 ■ 1 1 y2 norm and linearity of expectation, we can split 
each fj{x) into real and imaginary parts, fj{x) — fj{x) + ifj{x), where /J and /* are both 
real valued. We then have: 



< / E 



[2 ln(Ar)J 
1 



dx 



L2 ln(JV)J 



dx. 



For each £ from to [ln(21niV)] , we let J^i denote the intervals in on level £. On each 
level, these intervals are disjoint. Applying (j28l) to the quantity above for (the argument 
for P is identical), we can express the result as: 



E 



L21n(Af)J 



By linearity of expectation, this is: 



rin(21n JV)] 

E : 



rin(21nAf)] 






2' 




E 


> max 

^ I'CI 


E/^w(-) 




dx 


1=0 















2" 


> max 

^ I'CI 


E/.^o)(-) 




leJ^e ~ 


jei' 





dx. 



Now, for each £, we apply Corollary [31] to the dyadic intervals on level £. As a result, we 
see that the above quantity is 



« E 



2^-1 



E 



SC[L21„(JV)J] 



j (e dx+Y^j r^ixfdx 



Combining this with the same result for the imaginary parts, we have: 



(29) 



2^-1 



E 



SC[L21n(N)J] 

|S|=2* 



/(^E/;(^)j +(^^^^^)) dx + Y,Jj]{xf+f]{xfdx 



(30) 



We consider the quantity 

2 



When j ^ /, 



/[(x)/[,(x) + /i(x)/Ma;)dx = 0, 



since fj and /j' are orthogonal, and this is the real part of J,j, fj{x)fji{x)dx. Thus, 



E/;(-) 



J2fj{x)] « ^ / /;(x)2 + /i(x)2dx. 
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We then have: 



E 



L21n(W)J 



dx 



y2 



E 

^=0 



sX'^'< 



SC(L21„(N)J] jGS' 

|S|=2* 



By Parseval's identity, Jj \fj{x)\'^dx = J^neAj hnP- Since each j occurs in exactly 
sets of size 2^ for each £, the above quantity is: 



x)\ dx. 



421n(7V)J-n 
^ 2'--l ) 



N 



< Inln(iV) ^ |a„ 



This imphes that there exists some permutation a such that 



AT 



{fa{]){x)}jJi 



L21n(Af)J 



dx < Inhi(Ar) ^ |a„| 



Taking a square root of both sides of this estabhshes ((TT)) . as desired. This concludes our 
analysis of the operator. 

We now bound the contribution of the Vg operator. 

Lemma 32. For some tt in our class of permutations, 



N 



{a-,T{n)4>^{n)ix)} 



N' 



dx < Inln(iV) ^ |a„ 



Proof. We first observe that it suffices to prove the following inequality for each Aj. We 
let Hj denote the set of permutations of Aj, i.e. each tTj £ Ilj is a bijective map from 
[\Aj\] — >■ Aj. We consider choosing such a permutation uniformly at random. Then if we 
have 

2 



E 



<lnln(7V) 



{a^,{n)4'-^,(n){x)}^n^[ dx 
V ^ 

for each ], this means that there exists a permutation tTj of each Aj satisfying 

, , 2 

{a^,(n)'/'7r,(«)(a;)}. 



(31) 



x\A,\ 



dx < lnln(7V) ^ |a„| 



and these permutations can be put together to form a permutation tt as required for Lemma 
[3H We note that it does not matter how we concatenate the tTj 's: by definition of the Vg 
operator, it only matters how each Aj is permutated, not the order the ^j's are placed in. 

We now fix a j and we will prove (j3ip . By Fubini's theorem, we can interchange the 
order of the integral and the expectation and instead work with the quantity 



E 



{a7r,(«)07r,(«)(a;)}Ll'i 



V2 



dx. 



For each fixed x, we define the set of complex numbers C to be the set of values an<f>nix') 
for n E Aj. Then, these complex numbers c S C all satisfy 2~-'~^ < |cp < 2~^ (recall that 
|0„(a::)| = 1). We let Nj :— \Aj\, and we let random variables Zi, . . . ,Z]^. denote random 
samples from C taken without replacement. We then see that it suffices to show: 



E 



{Zn}nLl 



<lnln(7V)^ 



cec 



cec 



(32) 



To show this, we will need the following lemma: 
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Lemma 33. Let Xi, . . . ,Xn. denote uniformly random samples from C with replacement. 
For each k from 1 to Nj, we let Sk '■= X]i=i^i- ^'^'^ ^ subinterval I C [Nj], we let 
Si := J2i<£i -^i- Then for any k and any p > 2: 



E 



inax|5'/-E[5'/]P 

7C[fc]' 



where C is a positive constant. 

Proof. We rely on Hoeffding's inequality [S], which implies that 



max|i?e[S'/] -E[i?e[S'7]]| > t 

IC[k] 



^ exp 



-ct' 
k2-3 



(33) 



for some positive constant c, where Re[Si] denotes the real part of Si. (More precisely, 
Hoeffding's inequality is applied with the maximum over Sm for \ < m < k. However, 
moving to a maximum over arbitrary subintervals only results in a change of the constant 
c.) The same holds analogously for the imaginary part of Si. 
We note that 



E 



max|i?e[S'/] -E[i?e[S'/]]|P 

I<Z[k] 



^p fp-H 



maxlEeiS*/] -E[i?e[S'/]]| > t 



dt. (34) 



Applying this is 



<p / fP ^exp 



k2-3 



dt. 



— 1 — — 1 

We now perform the change of variable t — Xp , no dt = -\p dX. We obtain: 



exp 



-cX^I-P 
k2-i 



dX. 



We recall that r(z) := t^ ^dt. Performing the change of variable i = , we have 



r(z) 



sp sp^ ' e ds — — 



PJo 



sp e ds. 



We now see that 



We then set s 



exp 
This yields 



\k2-j 

-cX^P 
k2-i 



A, and we have: 



\k2-i J Jq \k2-i J 2 \2J 



E 



rascyi\Re\Si]-mRe\SiW 

/C[fe] 



By Sterling's formula, r(z) < (f )^ Thus, T (f ) < {^) ^ . By arguing analo- 

gously for the imaginary parts, we obtain: 



E 



ia&yi\Si-mSi]\P 

/C[fe] 



where C is a positive constant. 



□ 
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Using the above lemma, we estimate E 



y2 



as follows. We let N' = 2™ be 



the smallest power of 2 which is > Nj. We then decompose [iVj] into a family of dyadic 
intervals. More precisely, we define T to be the family of intervals of the form 



((d- l)2^d2^], £e {0,1,...,™}, de{l, 



-yrn—i 



}■ 



Now, for any interval /', there are (at most) two intervals Ii,Ir E such that I' IiU Ir 
and |/; U /r| < 4|/'|. Moreover, for any partition V of [Nj], the number of times an / e 
is associated to a.n I' £ V is upper bounded by a constant. (This is as we have argued 
previously.) 

We let ri denote our probability space {uj £ ft corresponds to a specified value for each 
Zn). Now, for a fixed lo € fl, we say an interval / C is good if: 



max ISt' 

I'CI 



^Pi']\^ < I?2-^|/|lnln(iV), 



where Z? is a positive constant whose value we will specify later. Otherwise, we say / is bad. 
We let V denote the maximal partition (which depends on a;). For each interval I' (£V, we 
have (at most two) covering intervals /,-,/; € J-. We let J'-p denote the set of intervals in T 
which correspond to intervals in V (each I € J- corresponds to at most a constant number 
of intervals I' £ V). We have: 



E 

rev 



nel 



^ > max 

^ I'CI 

leJ^-P ~ 



E^" 



nCl' 



We observe that 



Emax 
I'CI 



I is good 



E^« 

ne/' 



E' 



D2-^Nj lnln(7V) < Inln(iV) ^ |cp 



cCC 



E- 



since each |cp is between 2 ^ ^ and 2 ^, and \C\ — Nj. To see this, note that for each /' 

m 



cec ' 



\Sr\' « \Sr-E[Si']\' + |E[5r]p, and \E[Sr]\' = 

It only remains to bound the contribution of the intervals that are not good. For this, 
we first prove the following lemma. For each interval I G we let B{I) denote the event 
that I is bad (i.e. not good), and we let denote its indicator function. 

Lemma 34. For each I £ F , 

when D is chosen to be a sufficiently large constant. 
Proof. By Chebyshev's inequality, for any p > 2 we have 



max I Sn 

I'CI 



E[Sp]\^ > i:'2-^|/|lnln(7V) 



[ma^rcilSr -E[S i,]\P] 
{D2-i\I\\n\n{N)) 



p/2 



(35) 

We now rely on the following result of Rosen [T7] . 

Lemma 35. (Theorem 4 in JiTf j Let Xi, . . . , Xk be samples drawn from a finite set of real 
numbers with replacement, and let Zi, . . . , Zk be samples drawn without replacement. Let 
1 < rii < < ■ ■ • < Um- For every convex, monotone function (/> : R — )■ M, we have 



E 



E^" 

\n=l 



E^" 



< E 




E^" 

\n=l 
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We want to apply this lemma to the function /(x) := |a;|P, but this is not monotone. 
Instead we define monotone, convex functions /i, /2 such that \x\p — fi{x) + /2(a;), namely 
setting fi{x) = {—x)p for x < and equal to otherwise, and f2{x) — x^ for x > and 
equal to otherwise. We note that \x\p > fi{x), f2{x) always holds. 

Without loss of generality, we consider / equal to the interval of length |/| starting at 1. 
Then, for some constant H, we have: 



E 



I'CI 



< i/P E 



max fiiRe{Sn-E[Sn])) 

l<n<|/| 



HP 



max /2 (/m (5„ - E[5„])) 

l<n<|/| 



Here, Sn denotes the partial sum of Zi + Z2 + ■ ■ ■ + Zn, Re denotes the real part, Im denotes 
the imaginary part, and there are four terms in this sum: one for each combination of /i, /2 
and real and imaginary parts. 

We can apply Lemma 1551 to each of these four terms to replace the samples Zi, . . . , 
taken without replacement with samples Xi, . . . , X\j\ taken with replacement. Now applying 
Lemma [331 we have 



#p|/|fpf2-JP/2 



/^'^(lnln(7V))i|/|i2-jW2 



-^j pf(lnln(7V))-i 



for some constant H. 

Now, setting p :— lnln(iV)/e, this is: 



H 
D 



In(iV)^ 



We can then set D large enough so that 



< e 



-4e 



and the lemma follows. 



We observe that the contribution of the bad intervals is upper bounded by 



□ 



Lb(7) max IS"// 



We next apply Holder's inequality with q, r fixed to be constants such that ^ 
- > 2,r > 1. We then have that the above quantity is: 



(36) 
— 1 and 



«E(E[W)])' 



By Lemma [Ml we know that 



E 



max I Si' 

I'CI 



\2r 



(E[1b(/)])' «(ln(A^))- 



We also know that for each /', |E[S'/']|2 < (^J^j lEcec^r < ^ lY.cec^l'^- When we 
sum these up over all I E we obtain <C In(Af) jX^cec '^l^ ■ ^'^^ multiplying by ln(iV)~^, 



in 



we obtain a contribution which is 



Thus, it only remains to bound 



leJ" 



(HN))-' > • ( E 



max I Si' 

I'CI 



'^[Si']\'^ 



Similarly to our above arguments, we define convex, monotone functions /i, /2 : K — >■ M 
such that fi{x) + f2{x) = \x\^^ . More precisely, we set fi{x) = (— x)^'' when a: < and 
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equal to otherwise, while we set f2{x) — x^"^ when x > and equal to otherwise. Now, 
again applying Lemma [551 it suffices to bound e.g. 



max h{Re(Sn-nSn])) 

l<n<|/| 



where Sn is now the partial sum Xi + • • • + where each Xk is a sample from C taken 
with replacement. (We must also bound the analogous quantities for other combinations of 
/i, /2 and Re, Im, but these will follow via the same argument.) 
We now apply Lemma to obtain that the above quantity is 



max \Re(Sn-E\SnW 

l<n<|/| 



<t:J2{E[\Re{Sj-E[Sj])ny . 



Next applying Lemma [27l we see that this is 



«E 



max < 



\i\ 



, n=l 



where X„ is defined to be an (independent, uniform) sample from C with replacement, 
recentered to be mean zero. In other words, X„ = Xn — EX„. Now, since r > 1, both of 
the quantities in this maximum are <C |/|2^-'. Hence, we have: 



«5]|/|2-^«ln(7V)^|cp. 



cec 



Multiplying this by our bound (In(A^)) ^ for the probability of each / being bad, we see 
that this is o (X)cec 1*^1^) ■ This completes the proof of Lemma [5^ 



Combining Lemma [32] with ([27|. we obtain Theorem [T] 



□ 
□ 



7 Refinements of Theorem [3] for Certain Structured ONS 

In this section, we briefly outline how Theorem|3]can be improved for more restrictive classes 
of ONS, using the methods employed in proving Theoremj^l We consider an ONS such that 
for / in the span of the system, we have < Cp||/||L2 for p > 2, where Cp is a constant 

depending only on p. Such systems arise naturally, for example, as the restriction of the 
trigonometric system to certain arithmetic subsets (A(p) sets). We will use the fact that a 
maximal form of this hypothesis can be obtained from a very general theorem of Christ and 
Kiselev j2j. 

Theorem 36. Let {4'n}'^=i be an ONS such that for f in the span of the system, we have 
II/IIlp < C'p||/||l2 for some p> 2. Then 

\\Mf\\L. <S:s Cp\\f\\L2 (37) 

as long as p > S > 2. 

This last condition implies that the implicit constant is uniform for large p. Using this 
and the arguments in the proof of Theorem ^ one can obtain the following; 

Theorem 37. Let {(/'nj^i be a ONS such that if f is in the span of the system, then 
||/||lp <^ Cp\\f\\L2 for some p > 2. We then have that 

||/|U.(y.)«p lni/^(|A|)||/|U.. 

where the coefficients of f are supported a finite index set A. 
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We briefly sketch the proof. We note that if ^ II/IIl^ holds, then this theorem 

follows for p = 2. However, this is in general not true and by the sharpness of Theorem 
[21 the best one can hope for in the general case is a factor of ln(|A|) in place of ln^^^(|A|). 
The proof follows the same setup as the proof of Theorem [HI We define a bad event for 
some interval J to be the event that \Sj\ > ln^/^(|v4|)(M( J))!/^ (j^gj.g j^^j-^ jg defined to 
be the sum of over n G J, where the o„'s are the coefficients of 0„ in the expansion of 
/). It is easy to see that the contribution from the good events are of an acceptable order 
and it suffices to bound the bad events. The argument is essentially the same as the proof 
of Theorem [5J with the exception that we use the following estimate: 

/ r \ l/(p/2)' / r \ (2/p) 

(Here, (p/2)' denotes the conjugate exponent of p/2.) 

We now estimate \Sj\p « (/^ \Sj\Y' « C^p {M{J)y'\ Hence (/^ \~SjY) « 
CpM{J). Next, by Chebyshev's inequality. 



Hence (using l/{p/2)' ^ we have (^/t 1b(J) j < in(p-^)/p(|A|) " ™^ ^^^^^^^ 



T \JT 



Now we sum this quantity over ln(|A|) levels, each with the sum of M{J) summing 
to 1. Hence the contribution from the bad events to the quantity we wish to estimate is 
0(ln^^^(|^|)). This is exactly the order we wish to show. 

Finally, we observe that: 

Theorem 38. Let {(t>n}'^:=i be an ONS such that if f is in the span of the system, then 
\lp « VPll/lli^ all p > 2). Then 



\l^^v-) « v/lnln(|v4|)||/|U2, 

where the coefficients of f are supported on the index set A. 

This is proved using the same arguments sketched for the previous theorem, however 
now we have freedom to optimize over the choice of p we use. The optimum occurs with a 
choice of p about ce~^ lnln(A'^). Essentially the same argument is given in detail in the proof 
of Theorem [7] for random permutations (see the proof of Lemma I34p . Here it is important 
that the constants in the Christ-Kiselev theorem are uniformly bounded for large p. 

The above theorem can be applied to systems formed by Sidon subsets of the trigono- 
metric system, since the hypothesis of this theorem characterizes Sidon sets (when applied 
to subsets of the trigonometric system) by a theorem of Pisier [15] (see also [H]). 

8 Variational Estimates for the Operator 
8.1 Notation 

Let r : M — )• be a convex symmetric function, increasing on and tending to infinity 
at infinity such that r(0) = 0. Then the Orlicz space norm associated to F is defined as 

min <! A : / F I 4^ \ dx < I 



T 
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The fact that this norm satisfies the triangle inequahty is an easy exercise using Jensen's 
inequahty. We refer the reader to [TU] for the general theory of these spaces. Following [T], 
we will be interested in T := Tk defined as follows 



\t\<K 



Later we will also use 



/ |t|l/^ \t\<K 



We note that t'^jxit) < ^K{t) for all t. We state some other basic properties that we 
will need. 

Lemma 39. Let2=p. Then \\ ■ \\r is p- convex. That is, for any functions fi, fk from 
T to R, 



i/p 



i/p 



< 



Proof. Let TK,i/p{t) := Txit^^^), which we observe is still convex (we have used that p — 2 
here). Since ^K,i/pit) is convex, we can use it to form an Orlicz space norm. We observe 
that 



min < A : / Tk 



A 



= min< A : / Fj^ i/p 



dx<l} = 



i/p 



1=1 
i/p 



dx < 1 



i/p 



< 



El 

^1=1 



Ell/' 



p 



The inequality here follows from the triangle inequality for 



□ 



8.2 Proof of Theorem [TO] 

We now prove: 

Theorem 10. Let p > 2 and {4'n}n=i '^'^ orthonormal system such that ||0„||loo < C 
for all n. There exists a permutation n : [N] [N] such that the orthonormal system 
{ipn ■= 07r(n)}^Li Satisfies 



:HVp) <c,p lnln(iV)||/||z,2 



(38) 



for all f = J2n=l anil^nix). 

Our starting point is the inequality (3.21) of [I]: 

Theorem 40. Let {(j)n}n=i ^'^ orthonormal system with \ \(f>n\\L°° < C for all n. Then 
there exists a permutation tt : [N] — > [N] such that for all subintervals I of [N] and all real 
values ai, . . . , , the orthonormal system {-011 := <t>'K(n)}n=i satisfies: 



E O-ni^n 



1/2 



«c ln'/\N) I E «n 



(39) 



We will need a variational form of this inequality. This is easily achieved using a 
Rademacher-Menshov argument. 
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Lemma 41. With the notation as above, we have that 



i^n}ne/||y2 



- «/ 



II \nel ) 



1/2 



(40) 



for all I C [N] and all real sequences ai, . . . , gn- 



Proof. As in section [31 we assume (without loss of generality) that I = [2^] for some i and 
we define the intervals Ik,i := ik2\ {k + 1)2'] for < j < ^ and < A: < 2^"* - 1. For each 
J Q I, we can express J as a disjoint union of intervals Ik.i, where the union contains at 
most two intervals of each size. As in (O, we then observe for each x G T: 



[a„^^^„}„(ZI\\v2{x) < 

1=0 \ 



2'- 



fc=0 \nelk,i 

By the triangle inequality for the Orlicz norm, we then have 



[a.n''Pn}nel\\v^ 



N/\Il i=0 



2'- 



\ fc=0 \nelk.. 



N/\I\ 



Applying Lemma l39l this is 

i 

z=0 

By Theorem l40l we obtain 



2«->-l 



E 

^ k=0 



N/\I\ 



2«- 



Y: E «^. = in^/'wEjE«^-i-'''wjE«^- 



/i;=0 nGlk 



n6/ 



□ 



We now prove Theorem [TUl We assume (without loss of generality) that X]!^=i ^-n — 1- 
As in Section [51 we consider decomposing [N] into a family of subintervals according to 
mass, defined with respect to the a„'s. We recall that the mass of an arbitrary subinterval 
/ is defined to be M{I) := ^, 



nel ' 



We define the intervals Ik,s for 1 < s < 2 and points 
ik,s as in Section [51 We refer to the intervals Ik,s for 1 < s < 2*^ as the admissible intervals 
on level fc, and the points ik.s (as s ranges) as the admissible points on level k. We note that 
any interval / C [N] can be expressed as a union of intervals of the form Ik^s and points 
ik,sj where there are at most two intervals and two points for each value of k (this follows 
analogously to the proof of Lemma [T^ . This decomposition is obtained by first taking the 
intervals Ik^s and points ik,s contained in / with the smallest value of k. (There are at most 
2 of each, otherwise / would contain an admissible interval or point for a smaller k value.) 
These "components" of / on level k form an interval, and when we remove this from /, we 
are left with a left part and a right part. Each part can then be decomposed as union of 
intervals Ik,s and points ik,s for higher values of fc, and each of the two unions contains at 
most one interval and one point on each level. 

We let TT : [N] — > [A^] be the permutation as in Lemma HTl and ipn ■= (t>TT{n)- We fix an 
a; S T. The value of 

||{anV'n(2^)}^=l||yp 
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is achieved by some partition V of [N] . Eacli I E V can be expressed as a union of intervals 
of the form I^^s and points ik,s, and we denote the set of these intervals and points by T/ 
and tj respectively. We recall that each of Tj and tj will have at most two intervals or points 
(respectively) on each level. We also note that each admissible interval will appear in this 
union for at most one I € V. ^ 

We fix a positive constant c (depending on p) such that c > max{^ ^ p) ' ^} i^^^^ 
is possible because p > 2). We define k* :— clnln(A^) (more precisely, k* is the nearest 
integer greater than clnln(A^)). Now, for each / e 7^, all of the intervals in Tj and points in 
tj on levels greater than k* are contained in the two intervals Ik',se and Ik*,sr on level k* , 
where se is one less than the s value for the leftmost interval Ik*,s in Tj, and Sr is one more 
than the s value for the rightmost interval Ik*,s in Tj. We will use k* as a cutoff threshold: 
we handle the intervals and points at levels < k* directly and handle the intervals and points 
at levels > k* using the fact that they are contained in Ik* .se, Ik* ,sr-- We define Tj to be the 
subset of intervals in Tj on levels < k* and t'j to be the subset of points in tj on levels < k* 
Now, 



\{an'4'nix)}n=i\\yp is equal to: 



Kiev \nei 



Xa„V«( 




p\ i/p 



y^ y^ a„'(/'„(a;) + ^ a„'(/'„(2;) + a„'(/'n(a;) + Y "^"^^ 
iev\jeT;neJ JeTi\T;neJ net'^ neti\t', 




Applying the triangle inequality for the £p-norm, this is: 



i/p 



lev \net'j 

p\ i/p 



\iev \jeTi\T;neJ neti\t'j J J 

We consider the second of these three terms. Since p > 2, we have 



(41) 



uev Knet'j 




1/2 



.lev \net', 



For each fc < fc*, we let denote the set of admissible points on level k. Since each t'j 
contains at most 2 points in each £k, we can apply the triangle inequality to obtain 

( I \^\ fe* 1/2 

X X^"^"(^) «X X(""^»(^))') ■ 

^ev \net'j j J fe=o Xnetk / 

Now, by the triangle inequality for the norm and the fact that Jj a'^ip'^{x)dx = 

X X(°"^»(^))' «plnln(7V). 

k=0 \ne£k I jji 

To see this, recall that 'Y^=\ ^ ~ so 'Yl,netk ^n — "^ for each fc, and k* Inln(iV). 

It remains to bound the first and third terms in ([1T|) . We consider the first term. For 
each k, we let Ck denote the set of admissible intervals Ik,s as s ranges from 1 to 2*^ (i.e. 



for all n, we have 
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the admissible intervals on level k). Then, by triangle inequality for the norm and the 
fact that p>2, 



p\ i/p 



1/2 



lev \jeT; nej 



k* 



2\ 1/2 

<E|EI E E«"'/'«(^) 

k=o \iev yjeT^nCkneJ 

^ 2\ 1/2 

< E I E ( E'*"'^"(^^ 

fc=o \JeCk \neJ 

Now, using the triangle inequality for the || • 11^2 norm, we have: 



k* 



1/2 



E E E""^»(^) 



fe=o \,ieCk \neJ 



L2 



k* 

^E 

*:=0 



1/2 



E E ""V'n(a:) 



= E(E / fE-»^"(-)) 

fc=0 V .7e£fc "^^ ViGJ / 



1/2 



L2 



dx 



k* 



1/2 



5] X M(J) «plnln(iV), 



fc=o \JeCk / 

since X^jec^ M{J) = 1 for each fc, and fc* «Cp Inln(iV). 

We are thus left with the third term of (|^T|). For each I G V, we consider the union of 
the intervals and points in Ti\Tj and tj\t'j. This can alternatively be described as a union 
of at most two intervals Ji and Jr, where each of Jg, Jr is a subinterval of Ik' .s for some s. 
To see this, recall that / is decomposed into a union of admissible intervals and points by 
taking the admissible intervals and points contained in / for the earliest level where this set 
is non-empty. The remaining left and right parts of / are then decomposed separately. If 
the minimal fc is < fc*, then Ji is the union of the intervals/points in the decomposition of 
the left part that fall beyond level fc*, and Jr is the same for the right part. If the minimal 
fc is > fc*, then in fact all of / is contained in some admissible interval on level fc*, and we 
can take Jg to be this interval and Jr to be empty. We then rewrite the quantity we wish 
to bound as: 

p\ i/p 



E E O.ni'nix) + X ani^nix) 



Applying the simple fact that (a + hy < 2'P{a'P + 6^) for all non- negative real numbers a and 
6, we see this is 



« E E «"V'n(a;) J + 51 

\leV \neJe I \neJr 



p\ 1/p 



Now we observe that we are summing the values a„?/'„(x) over disjoint intervals, each of 
which is contained in Ik',s for some s. Thus, this quantity is upper bounded by: 



i/p 



- E l|{«"V'n(a;)}ne7fc.,, 



I VP 



, l<s<2'=* 
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Therefore, it suffices to bound 



i/p 



. l<s<2'' 



For each s from f to 2^ , we define disjoint sets Gs,Bs such that Gs U Bs = T. We 
define to be x e T such that \\{an^nix)}neh. J\vp < 2-'=^"1"(^)/p and B, to be the 
complement. By two applications of the triangle inequality (first in the £p norm and then 
in the norm), we have 



i/p 



\\{ani^n{x)}n€l^,J\vP 



L2 



^fGj|{a„Vn(a;)}„e4. 



, s=l 




1/p 



L2 



fsj|{a„'(/'n(a;)}n64 




1/p 



L2 



Using that ||{a„'0„(x)}„g/j., J|yp <C 2 for x £ Gs, we have that the first term 

is 0(f) (from the fact that there are at most 2'^'"'"(^) terms in the sum). We now estimate 



i/p 



Y '^BAx)\\{a.nTl'n{x)}neh' ,s\\vp 



L2 



1/2 



Iy2 



L2 



1/2 



^ H l|ls,(a;)ll{a„V'«(a:)}ne4 



(42) 



where Bs is the set of x e T such that ||{o„Vn(2;)}ne/fc. Jly^ > 2 cinin(Af)/p^ ^^j^^j have 
used the fact that Bs ^ Bs- 

We now consider two cases. First, we consider the set S'big of s values where |/fe->,s| > 
^2-''inin(^)^ Clearly, there can be at most 2^'"'"(^) such intervals. Now we bound the 
contribution to (|^^ above from these big intervals as 



1/2 



1/2 



Y II1b,(^)II{«»'^»(^)}»64-. 



|y2||^2 



, seSbig 



lL2(y2) 



Recalling that ||{a„-0„(x)}„g/^. Mh^v^) < ln^(iV)2-=i"'"(^) (fromLemma[Tl since M(/fc.,s) < 
2~'^ for all s) and that there are at most 2^'"'"^^' values of s e S'big, we have that the 
above is 

« (2^1nln(iV)lj^2(^)2-^l"l"(^))'^' « f. 

Here we have used that 9 < c. ft now suffices to consider the values of s such that s| < 

^2-71nln(N)^ 

We define 7* = 727inin(jv). For any real numbers e > 0, A > 1, and a > e, we have 
> I. We set e := 2-=1'i1"(^)/p. Now, for all x G Bs, we have: 



{a„?/'«(2^)}«6/fc._J|y2 < \ \{an'lpn{x)}neh. 



,2 7*(-^ ^\\{0'ni>n{x)}nelk' MV^) 



k* ,s M y2 



7*(A-ie) 



(43) 
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We recall that M{Ik',s) < 2-^'"'"(^) for each s. Analogously to 7*, we define := 
In !„(«). Now, for any A > 1: 

||{a„i/'n(a:)}„6/fc.,J|yp dx < \^ J 7* (^-^^ r*(A"^||{a„V«(a;)}„e/^._J|v2)da;. 

This follows from P5)) and the definitions of 7* and (recall also that t^7*(t) < r*(i) for 
aU t). 

Since > 2^'"i"(^) and the value of || • ||r^ increases as K increases, we can apply 

Lemma SI] to obtain 



1/2 

,2 



for all s such that l^.^^l < N2-^ inin(Ar)^ ^^^^^ jj 

is some fixed constant (depending on C). 
We see that for A :^ j:>ln^/'^(jV)2~ '"" 2"'"' , wehave /^r*(A-i||{a„V'„(a;)}„e/fc..J|yO'^a; < 



1. Therefore: 



|{a„V'n(x)}„e4.J|J,,da;«ln^/'(A^)2-='"'"W7*(^) (44) 



We consider the quantity 7* (j) . We observe: 

- = (£)-i)2i"i"(^)(-'=/P+=/2-V4)_ (45) 
A 

Now, if dH]) is > 27^"i"(^), we wih have 

7* (l)"' = 2-7/2i"i"(A^). 
If gH) is < 2^i"i"(^), we will have 

^1^"^ = ^l/22lnln(JV)(7/8-c/4+c/2p)^ 

We note that | — f + ^ < — |, because c{^ — ^ > x- Thus, in either case, 

7*(^)"'«c2-^/2'"^"W. 
Inserting this into pij) . we find that 

||{a„V„(x)}„e/.. jr^, «C lnV2(^)2-cl„lnW2-7/21„l„(A.) ^-c^n^niN) ^ 

Now to bound (|^^ . we apply this to each of the < 2'^'"'"(^) terms, yielding 0(1), completing 
the proof. 
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